Independence

A and B are independent, denoted P(AB), iff P(A|B)=P(A) or P(B|A)=P(B) or P(A,B)=P(A)P(B)

Conditional independence

X is conditionally independent of Y given Z if

P(X|Y,Z)=P(X|Z) or P(Y|X,Z)=P(Y,Z)or P(X,Y|Z)=P(X|Z)P(Y|Z)

Notion: P(XYZ)

Bayes' Rule

Product rule P(ab)=P(a|b)P(b)=P(b|a)P(a)

Bayesrule P(a|b)=P(b|a)P(a)P(b)

or in distribution form

P(Y|X)=P(X|Y)P(Y)P(X)=αP(X|Y)P(Y)

Useful for assessing diagnostic probability from causal probability:

P(Cause|Effect)=P(Effect|Cause)P(Cause)P(Effect)

An example of naive Bayes model:
Bayesian network representation-1777308741194.webp

P(Cause,Effect1,...,Effectn)=P(Cause)iP(Effecti|Cause)

Total number of parameters is linear in n

Bayesian network

A simple, graphical notation for conditional independence assertions and hence for compact specification of full joint distributions.

Syntax:

In the simplest case, conditional distribution represented as a conditional probability table (CPT) giving the distribution over Xi for each combination of parent values

Example: Bayesian network representation-1777308964529.webp

Global semantics

Global semantics defines the full joint distribution as the product of the local conditional distributions:

P(x1,,xn)=i=1nP(xi|parents(Xi))

active trail

Let Z be a subset of observed variables.
The trail Xi1XiXi+1 is active given Z if

direct separation(d-seperation)

Two sets of nodes X, Y are d-separated given Z if there is no active trail between any XX and YY given Z

To determine if X and Y are independent given Z:

A node is blocked if either the middle of an unmarked v-structure, or in Z (not both)

Local semantics

Local semantics: each node is conditionally independent of its nondescendants given its parents

Theorem: Local semantics ⇔ global semantics

Markov blanket

Each node is conditionally independent of all others given its Markov blanket: parents + children + children's parents